Disambiguating Personal Names on the Web Using Automatically Extracted Key Phrases
نویسندگان
چکیده
When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. How can we disambiguate these different people with the same name? This paper presents an unsupervised algorithm which produces unique phrases to disambiguate different people with the same name (i.e. namesakes). Our algorithm takes in a personal name and outputs multiple sets of phrases which uniquely identify the different namesakes on the web. These phrases could then be added to the query to narrow down the search to a specific namesake. We evaluated the algorithm on a collection of documents retreived from the Web. Experimental results show a significant improvement over the existing methods proposed for this task.
منابع مشابه
Identifying People on the Web through Automatically Extracted Key Phrases
Assume that we are looking for information about a particular person. A search engine returns many pages for that person’s name. Some of these pages may be on other people with the same name. How can we identify the results for the person that we are interested in from the others? A simple but an effective solution is to add a phrase in the query that uniquely identifies the person we are inter...
متن کاملExploring Key Phrases for Browsing an Online News Feed in a Mobile Context
This paper describes ongoing work on how to automatically identify and use key phrases extracted from items of a news feed available on the Internet. These phrases are used for two different tasks: users of mobile devices (e.g., cellular phones and personal digital assistants) will be able to subscribe to news in different categories, where the categorisation of the news is based on the extract...
متن کاملSemantic Search: from Names and Phrases to Entities and Relations
Web search is traditionally limited to keyword queries. In the era of Big Data and the Web of Linked Data, one would expect that schema-free search over both text and structured key-value pairs becomes more semantic, Systems should, for example, identify entities in queries and return crisp answers referring to facts, other entities and relationships. Some of these desired advances are happenin...
متن کاملExtracting Key Phrases to Disambiguate Personal Names on the Web
When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. How can we disambiguate these different people with the same name? This paper presents an unsupervised algorithm which produces key phrases for the different people with the same name. These key phrases could be used to further n...
متن کاملAutomatically Extracting Personal Name Aliases from the Web
An entity can be referred by multiple name aliases on the web. Extracting aliases of an entity is important for various tasks such as identification of relations among entities, automatic metadata extraction and entity disambiguation. To extract relations among entities properly, one must first identify those entities. Aliases of an entity are useful as metadata for that entity and can be used ...
متن کامل